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The work addresses the analogy between trivial knotting and excluded volume in looped polymer 
chains of moderate length, TV < TVo, where the effects of knotting are small. A simple expression for 
the swelling seen in trivially knotted loops is described and shown to agree with simulation data. 
Contrast between this expression and the well known expression for excluded volume polymers leads 
to a graphical mapping of excluded volume to trivial knots, which may be useful for understanding 
where the analogy between the two physical forms is valid. The work also includes description 
of a new method for the computational generation of polymer loops via conditional probability. 
Although computationally intensive, this method generates loops without statistical bias, and thus 
is preferable to other loop generation routines in the region TV < TVo. 



I. INTRODUCTION: FORMULATION OF THE 
PROBLEM 

The last few years have seen significant work address- 
ing the effects of knotting on looped polymer chains. Of 
interest to mathematicians and physicists for good part 
of nineteenth and most of the twentieth centuries, knots 
were first seen by W. Thomson as a way to understand 
the nature of atoms p] , and more recently as the basis for 
string theory. On the biological front, knots have been 
observed in, 0,0, and tied into, 0,0, strands of DNA. 
Additionally, topoisomerases - proteins which act to al- 
ter the topological state of DNA - are quite common and 
play a significant role in cellular processes. 

The requirements a knot imposes on a strand are hard 
to formulate in a simple way, as "interactions" between 
neighboring strands can require highly non-local changes 
in the coil's conformation to maintain topological state. 

That said, the most obvious effect knotting has on a 
loop is in the size, commonly measured in terms of ra- 
dius of gyration, R 2 g . For instance, the loop topologically 
equivalent to a circle, called a trivial or 0i knot in profes- 
sional parlance, is on average larger than the loop of the 
same length with any other topology. In other words, a 
trivial loop is larger than the phantom loop, the latter 
representing topology-blind average over all loops of a 
certain length: (iZ?V . > (R 2 n ) , , . This topology- 

° \ 9/triv \ 9 ' phantom ^ OJ 

driven swelling is operational even for very thin poly- 
mers, in the limit when volume exclusion has no effect 
on polymer coil size. In this case, the phantom loop's 
size (which is, once again, average over all topologies) 
scales as TV 1 / 2 , while the trivial loop is larger not merely 
because of a larger prefactor, but because of a larger scal- 
ing exponent, its size scales as TV", where v > 1/2. The 
conjecture, formulated a long time ago 0, supported by 
further scaling arguments M, la, and consistent with re- 
cent simulation data |9, Efl HjJ , specifies that the scaling 
exponent v describing topology-driven swelling of a triv- 
ial loop is exactly the same as the Flory exponent [l^ . 
which describes swelling driven by the self-avoidance (or 
excluded volume): v w 0.589 w 3/5. 



Equality of scaling exponents for the two cases reflects 
the similarity of fractal properties for these systems at 
very large TV ^> I, because topological constraints re- 
sult in self-avoidance of blobs on all length scales above 
a certain threshold 0. As we understand much about 
self-avoidance [l^, and next to nothing about knots, we 
would like to exploit the analogy to see if it yields any 
insights into knots. Specifically, it is tempting to look at 
the dependence of the unknotted loop size, \Rg) . . , on 
the number of segments, TV, not only in the asymptotic 
scaling regime of very large TV, but also the corrections to 
scaling at not-so-large TV. This is particularly important 
from a practical standpoint, because the asymptotic scal- 
ing limit is barely accessible computationally, and what 
one really computes is the value of {Rg) at rather 
moderate TV. Systematic comparison of TV-dependencies 
of (Ry) for (trivial) knots and self-avoiding polymers over 
the wide range of TV is the goal of this paper. 

We show that although large TV scaling appears to be 
identical for trivial knots and excluded volume polymers, 
their respective approach to the asymptotic regime is dif- 
ferent. This points obviously to the limited character of 
the analogy between the two mechanisms of swelling, due 
to volume exclusion and due to topological constraints. 

The plan of the paper is as follows. We start from a 
brief summary of the main results for self-avoiding poly- 
mers. Although these results are widely known, we re- 
state them in the form most suitable for our purposes. 
Next, we present some heuristic analytical arguments 
to shed light on why trivial knots may behave differ- 
ently then their excluded volume counterparts. With 
this insight in mind, we present our detailed computa- 
tional data on the TV-dependence of {R*) over the 
wide range of TV. To obtain data with the necessary de- 
gree of accuracy, it is necessary to make sure that our 
method of generating loops is ergodic and unbiased. Al- 
though this aspect is of decisive importance, it is purely 
technical, and thus it is relegated to the Appendix. Up 
to about section lll Cl we mostly review the known results, 
starting from section Til Dl we present our new findings. 
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II. PRELIMINARY CONSIDERATIONS 
A. Swelling driven by self-avoidance: an overview 

To make our work self-contained we now offer a brief 
review of the results for the sca ling of excluded volume 
polymers (see further details in |l3l IbH Il5 | h We should 
emphasize from the beginning that the main properties 
of the excluded volume polymer are valid also for loops 
|l6j| . The simplest model for excluded volume is a system 
in which N beads, each of volume 6, are placed along 
a loop with mean separation I. All other forms of ex- 
cluded volume, e.g. freely jointed stiff rods, worm-like 
filaments, etc., can be mapped to this simple rod-bead 
model (see. e.g., Q). There are two scaling regimes, with 



crossover at the length 

n* ~ (e 3 /b) 2 . (i) 

In terms of N*, the mean squared gyration radius (R 2 .) 
can be written as \Rg) — £ 2 Np(z), where the swelling 
factor p depends on the single variable z = v/ N/N*. For 
classical polymer applications, the large z regime is most 
interesting. p(z) has a branch point singularity in infin- 
ity, its large z asymptotics are dominated by the factor 
2-2Z/-1. noweveri if W e write p(z) = z 2l/ ~ 1 4>(z), then </>(z) 
is analytical in infinity and can be expanded in integer 
powers of l/z. Accordingly, the large N asymptotics of 
(R 2 g ) follow: 
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Conversely, in the region TV <C N* , the approximation for {R 2 g ) is afforded by the fact that p(z) is analytical at small 
z and can be expanded in integer powers of z: 
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where prefactor A' should be equal to unity (which ex- 
plains why we did not absorb the factor of 1/12 into 
A'). Note that the latter result is an intermediate asymp- 
totics, which means the corresponding region exists only 
so long as TV* 3> 1 is large, which means excluded volume 
is sufficiently small. 



B. Swelling driven by topology: cross-over length 

With this brief summary of results in mind we now set 
forward, intending to systematically compare the com- 
putational results for the behavior of trivial knots to the 
well-understood polymer with excluded volume. 

To look at the analogy between self-avoiding polymers 
and trivial knots, it is useful to start, 0, by identify- 
ing the cross-over length for knots, an analog of N* |QJ, 
which we call Nq. For knots, it is natural to identify 
the cross-over value of N with the so-called character- 
istic length of random knotting, No; the latter quantity 
is known as the characteristic length of the exponential 
decay of probability, wtriv(N), of formation of a triv- 
ial knot upon random closure of the polymer ends |17| : 
Wtriv — exp(— N/Nn). Depending on the specifics of the 
model used, 0, 0, , the critical length varies sub- 
tly around Nq w 300. It is also clear qualitatively y| and 
seen computationally ^lj that this Nq is about the length 



at which topological effect on loop swelling crosses over 
from marginality at N < No to significance at N > No. 
In particular, it is at N > No that the trivial knot be- 
gins to swell noticeably beyond the size of the phantom 
polymer [Til ]. 



C. Swelling driven by topology: above the 
cross-over 

A number of groups reported observation of the power 
v 3/5 in the scaling of trivial ITolllll ITflj and other 
topologically simple [3, ITol Hl| knots in the region N > 
N . 

In the works 0, 0, |2(j , following the idea suggested 
in [22^. the N dependence of (Rg) triv was fitted to the 
formula similar to equation @ for self-avoiding polymers. 
No attempt was made at physical interpretation of the 
best fit values of the three coefficients (A, ki, fcg) or the 
region of N where the fit was examined. In this sense, 
fitting with equation J2J was only used as an instrument 
to find the scaling exponent v, which in these works was 
found to be strikingly consistent with the expected value 
of the self-avoidance exponent. A puzzling aspect of the 
situation is that, particularly in the work [lOj, the data 
was fit to equation J2J n °t on ly m the region N > No, 
but across the crossover, starting from about Aq/3 to 
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about 3N (see also [20j1. 

At present we are aware of no studies which provide a 
detailed comparison of excluded volume and trivial knot- 
ting at modest N < No. Seeking to further appraise the 
analogy between trivial knotting and excluded volume, 
in the present work we address the two systems in the 
region below their respective crossovers. 



D. Swelling driven by topology: below the 
cross-over 

Formula (J3J is the result of perturbation theory [l5| . 
in which conformations with overlapping segments repre- 
sent a small part of conformational space and their exclu- 
sion is considered a small correction to Gaussian statis- 
tics. It is tempting to try a similar approach for knots. 
The idea would be to note that at small N < Nq, the 
probability of a non-trivial knot is small, which implies 
that restricting the loop such that it remains a trivial 
knot excludes only a small sector of the conformation 
space which therefore, comprises a small correction to 
Gaussian statistics. 

Let us try to imagine the realization of this idea. We 
want to find the swelling ratio of the trivial loop: 



po* = (*X 



/( r2 9 ), t 

• 1 \ y I phantom 



(4) 



We know that the (topology blind) ensemble average over 
all knots must, by definition, yield unity for the swelling 
ratio: 



(5) 



where Pi and pi are, respectively, the probability and 
swelling ratio of the knot i. Our plan is to consider for- 
mula 10 as the equation from which we can determine 
the quantity of interest, po t : 



POl 



1 - f3iP3i - P^ PA, 

Pa, 



(6) 



To this point our consideration is exact, but now we 
switch to hand waving arguments and guesses justified by 
the simulation data. In the range of small N, the ensem- 
ble of loops consists mostly of Oi knots, perturbed slightly 
by the presence of 3i and higher-order or more complex 
knots. We consider then N/No as a small parameter: 
N/N « 1. Of course, in the case of excluded volume, 
the similar limit is better justified, because N*, equation 
|JT)|. can at least in principle, be arbitrarily large, leaving 
room for the intermediate asymptotics 1 <C N <C N* . In 
the case of knots, Nq is as large as about 300, but so far 
we do not know why it is large, and it seems beyond our 
control to make it larger. Accordingly, we cannot speak 
of an intermediate asymptotics in a mathematically rig- 
orous way |2^]. Nevertheless, we assume here that the 
numerically large value of No allows us hope that the 
asymptotic argument is possible, and so we assume that 



N/Nq is a small parameter. We guess then that higher 
order knots provide only higher order perturbation cor- 
rections with respect to this parameter, and we neglect 
their contributions, simplifying the ensemble by account- 
ing for only Oi and 3i knots. In this case, Pg 1 + P^ 1 ~ 1. 
This is justified by the data presented in Figure^ which 
shows that higher knots are very rare indeed. Since we 
know that Pqi — ex P(~ N/Nq), we can also find P-s x . 
Given that we consider the N/Nq <§C 1 regime, we must 
also linearize the exponent, which yields: 



POi 



1-(1-P 0l )p 3l i-(i-e-"/"°)p 3l 

P Qi ~ e -N/N 

(l-(N/No)p 3l ){l + N/No) . 



(7) 



The next step requires thinking about p^ . In prin- 
ciple, we can come up with a chain of equations, not 
unlike the BBGKI chain in the theory of fluids, express- 
ing in terms of higher knots, etc. A more practi- 
cal course is to note that for the lowest order in pertur- 
bation, with respect to the supposedly small parameter 
N/Nq, since pz x has already the small (N/Nq) coefficient 
in front of it, it is enough to replace p^ with a constant 
at N/N -> 0. Thus, to the lowest order in N/N < 1 we 
get (N/Nq) p 3l ~ (N/N Q ) c, where c is a constant. We 
therefore finally obtain 



POl ~ 1 + (N/N ) (1 - c) 



or 



(4) 
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(8) 



(9) 



The difference between equations © and @ is imme- 
diately obvious: the former is an expansion in powers of 
y/N, the latter starts from the first power of N. The y^N 
term does not occur in our expansion for knots. Note that 
the values of the k[ coefficients in equation (J3J) arc known 
[l5| . and this prevents the easy (and incorrect) explana- 
tion that k'i = 0. As regards the value of coefficient c, 
we do not have at present an analytical means to calcu- 
late it, we will later estimate it based on the simulation 
data. Thus, despite identical scaling index at large N, 
trivially knotted and excluded volume polymers exhibit 
a very different mathematical structure of iV-dependcncc 
in their respective gyration radii in the region of small N. 

It is possible that another manifestation of the same 
difference is the fact that data in the work were suc- 
cessfully fitted to the equation © across the crossover 
region, where this formula for the self-avoiding polymers 
is not supposed to work. 

Thus, our considerations suggest that there is some 
fundamental difference between topology and self- 
avoidance in terms of their respective effects on the 
swelling at moderate N. In what follows, we present 
computational tests supporting and further developing 
this conclusion. 
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fractional abundance of simplest knots swelling beyond average gyration radius of fixed-steplength loops 




FIG. 1: Fractional abundance of Oi, 3i, and 4i knots within 
the ensemble of all looped polymer chains of fixed steplength. 
The Oi abundance follows the well known, [a. Il7l Hs|l . expo- 
nential decay, A exp (— N/No) with decay length An = 255 
and prefactor A ~ 1.05. Pertinent to the notion of higher- 
order knots acting as a perturbation is that the abundance of 
3i and 4i knots, seen in the inset, is quite low in the N <C An 
region. 

III. MODEL AND SIMULATION METHODS 

We model polymer loops as a set of N + 1 vertices, Xi, 
embedded in 3D, where xq = xn implies loop closure. 
The step between successive vertices, yi = Xi+\ — Xi is 
constructed either from steps of fixed length, with prob- 
ability density 

P(yi) = ^HWi\-i), (io) 

or Gaussian distributed, with probability density 

™ - G^r-p (-^ ■ 

Note that £, the "average" steplength, is defined, I 2 — 
j P(y)y 2 d 3 y. Many methods have been used to generate 
loops in computer simulation over the past decade. A 
brief review of the methods is available in Appendix lAl 
the details of the method implemented in this work are 
presented in Appendix iBl 

Once generated, we asses the loop's size by calculating 
its radius of gyration 

^ = ^E^- f /- ( 12 ) 

The mean square average radius of gyration seen over 
all loops is, (R 2 g ) = ±{N + f3)l 2 , where = 1 for fixed 
steplength loops and (3 = —1/N for loops of gaussian 
distributed steplength. Noting that the excluded volume 
constraint is maintained by the condition that pair dis- 
tances be larger than excluded volume bead diameter, 
Tij = \xi — Xj\, Tij > d, we record the minimum for 



N 
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FIG. 2: Direct comparison of excluded volume and trivial 
knot swelling, poi , beyond the phantom average size for loops 
of fixed steplength. Excluded volume is formulated in terms of 
N beads of diameter d, each centered at an universal joint be- 
tween loop segments. Exclusion is maintained by prohibiting 
bead overlap, \xt — Xj\ > d for all i 7^ j. As discussed in Sec- 
tion IHDl and in contrast to the region above their respective 
crossovers, in the small N < No regime, trivial knots follow a 
functional form different from that of excluded volume loops. 

each coil, which enables us to ascertain what maximum 
diameter of excluded volume, d, the loop corresponds to, 
|24|. Finally, we calculate the topological state of the 
loop by computing the Alexander determinant, A(— 1), 
and Vassiliev knot invariants of degree 2 and 3, v2 and 
v3, the implementation of which is described in [25J. As 
the simulation progresses, averages are accumulated in 
a matrix, indexed over different knot types and mini- 
mum pair distances. In the end, we can collect the data 
to find the gyration radius for either a particular knot 
type irrespective of pair distances (i.e., without volume 
exclusion), or for a particular excluded volume value ir- 
respective of topology. 



IV. RESULTS 

A. On the functional form of TV-dependence of the 
gyration radius in the moderate N regime 

Figure [21 provides direct comparison of the computa- 
tionally determined mean square gyration radius for triv- 
ial knots and phantom loops with excluded volume (av- 
eraged over all topologies), in the latter case - for various 
values of the bead diameter. Note that in the figure, the 
gyration radius is expressed with the swelling ratio p, as 
defined in equation © . The most striking feature of this 
figure is the differently shaped curves of swelling. The re- 
gion of intermediate N visible in the figure, 1 < N < Nq, 
shows the plot of trivial knot swelling passing through 
all excluded volume curves. As seen, the very shape of 
the poi curve is different. Specifically, all curves for the 
excluded volume loops are bent downwards, consistent 
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gyration radius swelling for trivially knotted, fixed-steplength loops 



excluded volume bead diameter matching trivial knot swelling at given N 
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FIG. 3: Average gyration radius data for trivially knotted 
loops of fixed steplength. Loops were generated with the 
conditional probability method described in the appendix. 
Swelling of the gyration radius is seen to be linear in the 
small A regime and can be understood initially as the result 
of a perturbation. 



with the presence of the y/~N terms in equation In 
contrast, the curve for the topologically restricted trivial 
loop is very nicely linear. A fit of the form 



p 0l = 0.998 + N/U37 « 1 + 0.18N/N 



(13) 



consistent our estimate, equation where Nq = 255, is 
shown in Figure 01 Note that deviation from the linear 
form occurs as N increases. This is entirely expected 
as the crossover to asymptotic swelling of the gyration 
radius, N 2v /N ~ N 0A9 , must occur as N grows beyond 
No- 



B. Which excluded volume diameter matches most 
closely the topological swelling of trivial knots? 

The cross-over points between curves of trivially knot- 
ted loops and loops with excluded volume in Figure El 
inspired the idea of plotting the excluded volume diam- 
eter at each A" whose swelling matches the swelling of 
a trivial knot at the same N. As seen in Figure 0] this 
mapping parameter seems to approach an asymptote at 
the specific diameter of d = 0.1625. While at present it 
is not computationally feasible to extend the scale of A~ 
to significantly larger values, this asymptotic approach 
of trivial knot swelling to loops with excluded volume is 
consistent with the similar asymptotic swelling of N 2v 
seen in other work |9j, llOl lllj . 

At the same time, it is interesting to note that although 
the swelling parameter due to the excluded volume at 
d « 0.16 seems to fit the topologically driven swelling, 
the corresponding characteristic length N* (see is 
significantly larger than Ao- To see this, we note that 
the excluded volume data in figure El fit reasona bly wel l 
to the expression p m 1 + l.7l\/N{d/lf = 1 + ^N/N*, 
where, therefore, A^* = 0.34(<i/£) 6 . Here, we determined, 
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FIG. 4: The excluded volume bead diameter which gives the 
same (Rg) swelling as the group of trivially knotted loops. 
Unlike other figures in the publication, loops here are gener- 
ated conditionally with gaussian distributed steplength. This 
is done for feasibility reasons, as computationally, gaussian- 
distributed steps are easier to generate than loops of fixed 
steplength. As seen in the image, the excluded volume diam- 
eter seems to saturate at about d — 0.1625. This saturation 
is consistent with the notion of the trivial knot gyration ra- 
dius average approaching the N 2v asymptotic when N S> Ao. 
Although not tested, we expect that fixed steplength loops 
would exhibit similar saturation at a specific excluded vol- 
ume diameter. 



based on the fit, the numerical coefficient intentionally 
left undetermined in formula QJ. At d — 0.16-^, we get, 
therefore, N* ss 20000, which is almost two orders of 
magnitude greater than Ao « 255. Alternatively this sit- 
uation can be seen by finding the excluded volume diame- 
ter for which crossover length N* matches ATq: A^* = Ao; 
the corresponding d equals d w 0.33£ It is fairly obvi- 
ous that this value of excluded volume does not agree 
well with the data presented in figure 01 This discrep- 
ancy possibly points at yet another difference between 
swelling driven by topology and excluded volume. 



V. CONCLUSIONS 

It seems quite clear from our simulation data that the 
analogy between excluded volume and trivial knotting 
does not hold at loop sizes smaller than the crossover for 
knots, Ao- The nature of the swelling function, p(N), 
in this region is yet unknown. Although our cursory 
explanation accounts for the trivial knot data's linear 
trend in this regime, the similar parameter for the size 
of more complex knots behaves non-linearly, and we cur- 
rently have no explanation for this. A more systematic 
treatment of the problem is badly needed to understand 
the size behavior of knots. 

That said, our data showing the mapping of excluded 
volume diameter to trivial knot size seems to reinforce 
the notion that asymptotically, the two classes of objects 
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scale with the same power. 
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APPENDIX A: A BRIEF REVIEW OF LOOP 
GENERATION METHODS 

A number of methods exist and have been used in 
the literature for the computational generation of looped 
polymers. The goal of generation methods is to produce 
statistically representative and unbiased sets of mutually 
uncorrelated loops. The generation of a random walk is a 
simple matter. Steps are chosen with isotropic probabil- 
ity until the desired length is reached. Creating random 
walks with biased probability, specifically, walks which 
return to the origin after a specified number of steps, is 
a more difficult task. As many studies of the topological 
properties of polymer chains have been completed, we do 
not intend to make an exhaustive summary of all work, 
but rather in broad strokes summarize the generation 
methods used in the field. 

All methods used to generate loops can be grouped into 
two large categories. Methods of one group start from 
some loop configuration which does not pretend to be 
random, and then transform it in some way to randomize 
the set of steps making the loop. Methods of the other 
group build more or less random loops from the very 
beginning. 

One of the initial techniques used for the generation 
of loops is the dimerization method of Chen, [26L |27| . in 
which smaller sets of walks are joined end to end to form 
larger walks or loops. This "Ring Dimerization" accepts 
the joining of smaller walks with some probability, as 
self-intersections between the chains are prohibited. In 
addition, if the generated walk is closed to form a loop, 
a statistical weight is calculated to account for loop clo- 
sure. Several groups have used this method, [lTJ l21| - 
usually in the context of including excluded volume in 
the topological study. 

Other workers, [UlUl, start with an initial loop con- 
formation and then modify it by applying a number of 
"elbow" pivot moves on randomly selected sections of 
the loop. Specifically, if the loop is defined by N ver- 
tices, {xi}, a pivot move is performed by selecting two 
vertices, Xj and Xk, and then rotating by a random angle 
the intermediate vertices x^+i through Xk-x about the 
axis made by Xk — Xj . 

A third method in common use, the so-called "hedge- 
hog" method [Icl |28[. starts by generating N/2 pairs of 
mutually opposite bond vectors. The resulting set of N 
vectors has zero sum, and it is tempting reshuffle them 
and then use as bond vectors, thus surely obtaining a 



closed loop. Unfortunately, such a loop has obviously 
correlated segments, the most striking manifestation of 
which is that the loop has self-intersections with a large 
probability of order unity (in fact, 1/e w 0.37, [2^; see 
also a related scaling argument in [Tl| ) - To overcome 
this, Dykhne [2£| suggested imagining all N vectors plot- 
ted from the origin and thus forming something like a 
hedgehog, and then randomly choosing pairs of vectors 
(hedgehog needles), and rotating the pair by a random 
angle about their vector sum. This operation does not 
change the sum of all N vectors, which remains zero, and 
therefore, upon sufficiently many such operations and 
upon reshuffling all vectors, one can hope to obtain a 
well randomized loop. 

The hedgehog method and elbow moves method are 
in fact quite similar. Indeed, in both cases the idea is 
to rotate some bond vectors around their vector sum; in 
the hedgehog method it is done with pairs of vectors be- 
fore reshuffling, in the elbow moves method it is done 
after reshuffling with a set of subsequent bonds, but the 
idea is the same. In both cases, the evolution of loop 
shape can be described by Rouse dynamics, known in 
polymer physics (see, e.g., 0). This allows us to make 
a simple estimate as to how many moves are necessary 
in order to wash away correlations imposed by the initial 
loop configuration. Rouse dynamics can be understood 
as diffusive motion of Fourier modes. Since the longest 
wave Fourier mode has wavelength which scales as N, the 
longest relaxation time in Rouse dynamics scales as A 2 . 
This estimate is valid for physical dynamics in which all 
segments move at the same time. Translated into compu- 
tational language, this implies that every monomer has 
to make about ./V 2 moves, which means that we have to 
make about A 3 random moves for proper removal of cor- 
relations. Unfortunately this point is rarely mentioned 
in the use of these algorithms, (see however, 19]), and 
the number of moves between sampling is generally quite 
small, which puts into question the ergodicity of imple- 
mentations of this algorithm. 

To overcome this problem, we proposed in p"H another 
method which we call the method of triangles, which docs 
not involve any relaxation. In this method, we gener- 
ate A/3 randomly oriented triplets of vectors with zero 
sum, reshuffle them, and connect them head-to-tail, thus 
obtaining a loop. As we shall explain in another pub- 
lication, this method produces loops with insignificant 
correlations when N is larger than a hundred or so. 

Since our major attention in this article is the range of 
relatively small N, we have to resort to a computationally 
more intensive, but reliably unbiased method based on 
conditional probabilities. The idea is to generate step 
number i in the loop of A steps using the conditional 
probability that the given step arrives to a certain point 
provided that after N—i more steps the walk will arrive at 
the origin. This method was suggested and implemented 
for Gaussian chains in 30]. Here, we apply it for the 
loops with fixed step length. 
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APPENDIX B: GENERATION OF LOOPS WITH FIXED STEPLENGTH USING THE CONDITIONAL 

PROBABILITY METHOD 

1. Derivation of the Conditional Probability Method 



A walk is composed of N steps between N + 1 nodes, a step from nodes Xi to Xi+\ having normalized probability, 
g(xi, Xi+i, 1). The probability for a random walk composed of N such steps is described by the Green function which 
ties the steps together, 

(Bl) 



G(xq,xn,N)= / g(xi — xo)g(x2 — xi)...g(xN — XN-i)dxidx2---dxN-i 



Note that in this notation the walk stretches from xq to xn ■ The specifics of integration depend on the sort of steps 
which are being taken. At times, these integrations can be difficult to evaluate. In such cases the convolution theorem 
can be of some utility. Suppose that the Fourier transform and inverse is defined in the usual way, 



9k = 0j9(x)exp 
g{x) = (3 J g^exp 



ik ■ x 
-ik ■ x 



dx 
dk. 



(B2) 



Note that in this formulation (3 = (2n) 3 / 2 . The convolution theorem allows for the following expression for N > 2, 



dk 



(B3) 



G(x ,x N , N) = {1/I3f- 2 J {g z ) N exp \-ik ■ {x N - x ) 
If steplength is fixed to a certain distance, £, the probability distribution and its fourier transform are expressed, 

g(x ,Xi, 1) fixed 



9k = 



(B4) 



kf 



Using equations (|B3|I and l|B4|l . along with differential volume dk — 2nk 2 dkd (cos 8), the probability distribution for 
a walk of N fixed-length steps spanning the displacement x^ — xq is, 



G(x ,x Nl N) fixed = /3 2 47r 



sin [kl] \ N sin [k \x N - x 1] 



io \ k£ J k \xn — xq\ 
If we use the definition of (3 and express Sine terms as exponentials, also using d — \xn — xq \ /£ then, 



I 



G(x ,x N ,N)f ixed = — 



(exp [ik£] — exp [— iki]) (exp [ikld] — exp [— ikid)) 2 



(2ik£) N + 1 d 



k l dk 



(B5) 



(B6) 



Then using the Newton binomial (x + y) N = X) m =o im) xN m y m , where, ( m ) = ( n _^)i m ; ; yields a shiny prize, an 
analytically tractable expression: 



G{x ,x N ,N) fixed - 2N+2iN+1(N+ld J Km 



N\ (exp [ik£]) N - m {- exp [-ik£]) m {exp [ikid] - exp [-ikid]) 



k N ~ x 



dk 



(H7) 

At this point two further simplifications are made. The first is to extend the integration from — oo to oo, as the 
integrand is even on the real axis (with proper incorporation of the factor of 1/2). The second simplification is to 
integrate over the dimensionless number, k — kl. Note that the dimension of the integral remains 1/ 'volume. 



G(x ,x N ,N) fixed 



Nl 



ir 2 2 N +H N + 1 £ 3 d 



N 



^ (-1) exp [w(N - 2m + d)\ - exp [ik(N - 2m - d)] 
2^ ~ — irmrr rvri dn . (B8) 



(N — m)\m\ 

m=0 v ' 



The integral which remains can be evaluated as a contour integral in the complex plane. The contour along the real 
axis is chosen with a small bump in the +i direction at k — 0. The upper or lower arch is chosen according to Jordan's 
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Lemma. The residue at k = is obtained by Taylor expanding the exponent to resolve the coefficient corresponding 
to the k _1 term, which is the definition of a residue. The result follows, 

00 exp [ia K ] ( if a > 



— OO 



Integration winnows the sum considerably, the final result is, 



N(N - I) 

G{x ,x N ,N) fixed = 2 ; +2 ^ 3 J (Ji(N,d) - J 2 (N,d)) , (BIO) 



where 

JV 



Jx(N,d)= 7W M i (N-2m + d) N - 2 , (Bll) 

(TV — to)! to! 



m>(AT+d)/2 

and 

^ (—'\Y n 

m>(N-d)/2 v ' 

A table of probabilities can then be composed. Note however that the probability is defined on intervals over d, listed 
in the right column below. 



(B13) 



G(x ,0,3) fixed = { ^ de [0,2] 

U(X ,U,6)f ixed - j (3_ d )/( 16?r ^3) de [ 1)3 j 

, , _ / (8 - 3d)/(64^ 3 ) de [0,2] 

u^o,u,4j /Med - 1 ( d _ 4 )2/( 647 ^3 d ) de [2,4] 

(5 - d 2 )/(647^ 3 ) de[0,l] 
G(f ,0,5)/«ed = { (2d 3 - 15d 2 + 30d- 5)/(1927rf 3 d) de[l,3] 

-(d- 5) 3 /(3847r^ 3 d) de[3,5] 

(5d 3 - 24d 2 + 96)/(1536tt^ 3 ) d e [0, 2] 

G(f ,0,6)/« ed = <( (-5d 4 + 72d 3 - 360d 2 + 672d- 240)/(30727^ 3 d) de[2,4] . 

(d- 6) 4 /(30727r^ 3 d)) de[4,6] 

These piecewise-defined probability distributions approach the shape of the corresponding quantity for gaussian 
distributed steplength, 



3/2 

G(x ,x N ,N) gaussian = ( 27r ^2 ) exp 



3 



2NP 



(xjv - 2? ) 



(B14) 



Due to the complexity and computational expense of the conditional method, and noting the apparent similarity of 
the two curves, one might be tempted to substitute the Gaussian formulation, equation i|B14fl . when N is above some 
threshold, N > N c . Our own experience with this approximation leads us to discourage the intermingling of the two 
distributions. When included, at even the large iV c = 30, a sharp discontinuity in the curve of curve for poi v s N 
(Figure ©) was visible at N c . We hypothesize that substitution of the Gaussian formulation, equation l|B14|l . for 
the fixed-step formulation, equation (jBlOfl . allows for slightly more inflated loop conformations and thus leads to a 



discontinuity when the approximation is used in the simulat ion code at N > N c . 



2. Implementation of Conditional Probability 
Method 

Generation of a random walk which is looped, i.e. 
x jv — xq = 0, can be achieved with the use of the al- 



ready derived equations. Imagine that a walk of N + M 
steps is underway and M steps have already been taken. 
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taken. Thus the expression above becomes, 



1 Sin[9] 



FIG. 5: This geometry is used in the implementation of the 
conditional probability loop generation method. 



This means that a walk of N steps remains, which starts 
at the present location, xq, and finishes at the starting 
point, xm- The probability distribution for the next step, 
from xq to X\, can then be written, 



P{x \xi) 



G(x a ,xi, 1)G{xx,x n , N - 1) 
G(x ,xn,N) 



(B15) 



In principle one could generate new steps with proba- 
bility isotropic in direction, accepting them with condi- 
tional probability defined by equations (|B15I) and <|B10|) 
or (|B14() . In the interest of efficiency, a better method is 
to generate random steps within these probability distri- 
butions. Now discussed is the way to transform a flat ran- 
dom distribution (that produced by the UNIX math func- 
tion drand48() for example) into the distribution above. 
If the flatly distributed variable is q, ie P(q) = 1 on [0, 1), 
elsewhere, the following equation, with d = x^ — xi, de- 
fines the transform to the conditional distribution above, 
G(x), 

P(Q )d{q ) = / z. tts d(d), 

o Jo G(x ,x N ,N) 

(B16) 

In this statement of normalization, the function of impor- 
tance is f{q), which defines the way the two probability 
distributions are made equal. 

In principle the problem is now solved. A complete set 
of probability distributions for walks of fixed or gaussian 
steplength has been defined, and the the formula which 
maps that distribution to a flat, machine-generated dis- 
tribution has also been expressed. If the form of equa- 
tion (|B16|) is simple enough, meaning relatively small N, 
the integral equation can be solved directly for f(q). In 
practice however, N > 5 is an interesting regime and a 
different technique must be used to obtain f{q). 

For the case of finishing a random walk of fixed length 
steps, £, which is x away from the ending point, and has 
N steps alloted to get to that point, we use the geom- 
etry shown in figure JSJ). In this diagram x p is the new 
distance away from the endpoint after the present step is 



P(q')dq> = 



f(q) G(T,l)G(x- p ,N-l) 
G(x, N) 



d(x p ) , 



(B17) 

where, for convenience, the following syntax is used, 
G(b,0,N) = G(b,N). 

Of course the single step G(d, 1) is a delta-function, 
S(\d\ — £)/4n£ 2 , so the integration over d{x p ) occurs over 
most or all of the spherical shell created by the possible 
orientations of £. Integration over the shell (about the 
axis made by x) is performed in "rings," each ring having 
circumference 2-k£ sin[0], and width, £d(9), with resulting 
differential area, dA — 2ir£ 2 sm[9]d(9). 9 is integrated 
over the range, [0, it]. 



It should be apparent that, x p 



2x£cos[9]. 



This yields the differential transform, s'm[9]d(9) — 
(xp/x£)d(x p ). This simplification allows the integration 
of equation l|B17() in the following way, 



P{q)dq' 



2£xG{x, N) 



/(«) 

G(x p , N - l)x p d(x p ), 

lin 

(B18) 

This expression is normalized to 1 if integrated over ap- 
propriate x p bounds. In most cases, those bounds are 
[x — £, x + £], although the physical limit on the upper 
bound, x p < (AT — 1)£ is necessary to keep the walk from 
straying too far from the origin. Additionally, if the walk 
is very close to the origin, x < £, the integration bounds, 
[£ + x, £ — x], are used. 

As Equation (|B12|) for fixed steplength probability is 
defined as a polynomial, integration of that polynomial, 
described by Equation l|B18|l . can be performed exactly 
within simulation computer code, and the resulting equa- 
tion for f(q) solved numerically. In practice we use the 
Gnu Multiple Precision library to represent the polyno- 
mial coefficients and values as rational numbers. From 
a computational standpoint this is significantly more ex- 
pensive than representing coefficients as double floating 
point, but using rationals allows us to represent all out- 
puts of the polynomial with great accuracy, the goal of 
this simulation method. At a relatively small number of 
steps the coefficients become quite small, for example at 
N = 15, in the region x £ [13, 15], equation (|B 12|> reads, 

40809403l>i488 1 o 3 (^d J ) ■ We feel the need in this routine to re- 
tain accuracy when performing operations such as P — Q, 
where P > 1 and Q > 1 but (P - Q) < P, Q. In or- 
der to retain the accuracy of the conditional formulation 
it was imperative to perform this rational number alge- 
bra. For the interested reader we provide a table of these 
polynomial coefficients as supplementary materials. 
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I. SUPPLEMENTARY MATERIALS 
A. Polynomial Description 

In the associated work, we derive expressions for the 
conditional probability of a random walk, G(d£,n)fi xe d, 
with end to end distance di from n steps of fixed length 
I, In these supplementary materials, we present these 
probabilities in polynomial form, making them computa- 
tionally more accessible. 

First, note that the sum indices in Equations Bll and 
B12 lead to a piece-wise definition of the functions (this 
is seen in Equation B13). The function, G(d£,n)fi xe d, is 
defined on (n+l)/2 intervals if n is odd, and n/2 intervals 
if n is even. All intervals span distance in d of 2, with 
exception of the first interval for odd n, d £ [0, 1]. 

Use of a computer algebra system allows for the ex- 
pansion of equation BIO into the form, 

where the set of coefficients are rational numbers. Note 



however that there will be roughly n/2 such sums for each 
n as described in the previous paragraph. It is therefore 
useful to think of the sets of coefficients as list elements, 
a n,j.ii where n refers to the number of steps in the walk, 
j refers to the interval in d of definition, and i to the 
specific power of d in the polynomial. 

Polynomials are defined in the attached fractions32.txt 
and fractionsl01.txt coefficient files on the interval of n £ 
[2,n c ], in this case n c = 32 and n c — 101 respectively. 
The first element in each file is n c . 

Subsequent file elements are the a n jj coefficients, de- 
limited by commas, and provided as rational numbers. 
Coefficients are listed by a nested iteration, first by step 
number n, then by interval j, finally by polynomial index 
i. Were n c = 3, the file would read: 



3 , 1/8 , , 1/8 , 3/16, -1/16 

' lc a n = 2 ,j = 0,i = °™ = 3,3 = 0,i = o n =3,J=0,l=l H B =3,,'=l,i 

(2) 



